Natural Language Description of Video Streams Using Task-Specific Feature Encoding
نویسندگان
چکیده
منابع مشابه
Natural language descriptions for video streams
Digital images and videos collection has increased exponentially in the recent years as more and more data is available in the form of personal photo albums, handheld camera videos, feature films and multilingual broadcast news videos, presenting visual data ranging from unstructured to highly structured. Today video data accounts for 80 percent of all network traffic. There is a need for quali...
متن کاملA framework for creating natural language descriptions of video streams
This contribution addresses generation of natural language descriptions for important visual content present in video streams. The work starts with implementation of conventional image processing techniques to extract high-level visual features such as humans and their activities. These features are converted into natural language descriptions using a template-based approach built on a context ...
متن کاملNatural Language Descriptions for Human Activities in Video Streams
There has been continuous growth in the volume and ubiquity of video material. It has become essential to define video semantics in order to aid the searchability and retrieval of this data. We present a framework that produces textual descriptions of video, based on the visual semantic content. Detected action classes rendered as verbs, participant objects converted to noun phrases, visual pro...
متن کاملAction Recognition Using Hybrid Feature Descriptor and VLAD Video Encoding
Human action recognition in video has found widespread applications in many fields. However, this task is still facing many challenges due to the existence of intra-class diversity and inter-class overlaps among different action categories. The key trick of action recognition lies in the extraction of more comprehensive features to cover the action, as well as a compact and discriminative video...
متن کاملBidirectional Natural Language Parsing using Streams and Counterstreams
This thesis investigates the bidirectional exchange of information between linguistic and non-linguistic semantic inputs containing ambiguities. Such exchange is critical to Cognitively Complete Systems, in which collections of related representations and processes cooperate for their mutual problem-solving benefit. The exchange paradigm of reconciliation is defined, in which ambiguities and ga...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2018
ISSN: 2169-3536
DOI: 10.1109/access.2018.2814075